Corpus: msa-in_web_2015_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 90 91 93 93 99
1000 760 938 966 971 992
10000 4957 7778 8480 8672 9030
100000 11632 22278 25508 26456 27425
1000000 11632 22278 25508 26456 27425


Zipf's diagram for sentence endings


Gnuplot diagram

5209 msec needed at 2018-05-25 22:10